Rank in Wordlist | Frequency | Word |
---|---|---|
5671 | 330 | 1,5 |
6576 | 282 | 2,5 |
7885 | 232 | %, |
9610 | 187 | 3,5 |
10560 | 169 | 4,5 |
10993 | 162 | 0,5 |
12117 | 146 | 1,2 |
14913 | 115 | 1,3 |
16959 | 99 | 5,0 |
17355 | 96 | 1,6 |
Rank in Wordlist | Frequency | Word |
---|---|---|
26696 | 57 | Jetty(6.1.26 |
30953 | 47 | Ključa(-ne |
31037 | 47 | beseda(-e |
42164 | 31 | Menil(a |
43911 | 30 | s(m)o |
64795 | 17 | A1(4 |
84689 | 12 | možnih(1 |
86694 | 12 | »(2 |
86695 | 12 | »(4 |
89973 | 11 | možnih(4 |
Rank in Wordlist | Frequency | Word |
---|---|---|
12779 | 137 | %) |
13048 | 134 | 0)Moj |
15834 | 107 | %). |
21549 | 74 | %), |
30013 | 49 | IV.3.1)Referenčna |
30014 | 49 | IV.3.2)Prejšnje |
34271 | 41 | II.1.1)Naslov |
36955 | 37 | II.1.2)Vrsta |
38506 | 35 | 0)2 |
40664 | 33 | osebe)REGISTRACIJA |
Rank in Wordlist | Frequency | Word |
---|---|---|
2401 | 757 | 100% |
4410 | 424 | 5% |
5269 | 353 | 50% |
5438 | 342 | %. |
6234 | 299 | 20% |
7080 | 262 | 10% |
7885 | 232 | %, |
8937 | 202 | 30% |
11949 | 148 | 80% |
12031 | 147 | 90% |
Rank in Wordlist | Frequency | Word |
---|---|---|
23367 | 67 | H&M |
35640 | 39 | S&P |
39432 | 34 | S&T |
43240 | 30 | Lisac&Lisac |
54039 | 22 | C&R |
78997 | 13 | Viator&Vektor |
99327 | 9 | AT&T |
100349 | 9 | M&M |
100959 | 9 | R&B |
101366 | 9 | U&S |
Rank in Wordlist | Frequency | Word |
---|---|---|
37697 | 36 | 50$ |
50651 | 24 | $. |
73976 | 14 | $1.00 |
82800 | 12 | M$ |
98982 | 9 | $2.00 |
98983 | 9 | $5.00 |
106561 | 8 | $500 |
127238 | 6 | $, |
127239 | 6 | $100 |
127240 | 6 | $3.00 |
Rank in Wordlist | Frequency | Word |
---|---|---|
141924 | 5 | %" |
Rank in Wordlist | Frequency | Word |
---|---|---|
30079 | 49 | You're |
35566 | 39 | I'm |
38548 | 35 | Don't |
41174 | 32 | L'Oreal |
48845 | 26 | rock'n'roll |
49158 | 25 | 12's |
54420 | 22 | Victoria's |
56398 | 21 | cee'd |
57944 | 20 | It's |
60592 | 19 | d'Iseru |
Rank in Wordlist | Frequency | Word |
---|---|---|
355271 | 1 | 16%+22% |
Rank in Wordlist | Frequency | Word |
---|---|---|
484 | 3000 | Napisal/-a |
1059 | 1610 | lighttpd/1.4.20 |
5812 | 321 | in/ali |
11741 | 151 | km/h |
14580 | 118 | Pridružen/-a |
15621 | 109 | 2012/13 |
15836 | 107 | 2013/14 |
17730 | 94 | napisal/-a |
18938 | 87 | Objavil/a |
19278 | 85 | 2011/2012 |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots